Picture for Yuji Wang

Yuji Wang

Spatial-Temporal Decoupled Reference Conditioning for Identity-Preserving Text-to-Video Generation

Add code
Jun 01, 2026
Viaarxiv icon

SAFE-Pruner: Semantic Attention-Guided Future-Aware Token Pruning for Efficient Vision-Language-Action Manipulation

Add code
May 28, 2026
Viaarxiv icon

Segment Anything with Motion, Geometry, and Semantic Adaptation for Complex Nonlinear Visual Object Tracking

Add code
May 21, 2026
Viaarxiv icon

TAIHRI: Task-Aware 3D Human Keypoints Localization for Close-Range Human-Robot Interaction

Add code
Apr 10, 2026
Viaarxiv icon

Rethinking IRSTD: Single-Point Supervision Guided Encoder-only Framework is Enough for Infrared Small Target Detection

Add code
Apr 07, 2026
Viaarxiv icon

Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings

Add code
Feb 14, 2026
Viaarxiv icon

Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents

Add code
Jan 16, 2026
Viaarxiv icon

DDAVS: Disentangled Audio Semantics and Delayed Bidirectional Alignment for Audio-Visual Segmentation

Add code
Dec 23, 2025
Viaarxiv icon

IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis

Add code
Mar 02, 2025
Viaarxiv icon

ARMOR: Shielding Unlearnable Examples against Data Augmentation

Add code
Jan 15, 2025
Viaarxiv icon